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@ Improve d recombinant expreaalon method, vector and transformed cella. 

@ A method for continuous production of a desired heterolo- 
gous protein comprising constructing an expression vector 
having a stabSteing sequence downstream of a promoter and 
upstream of the DMA encoding tha cesired heterologous 
protein, tr ana f act in g and choosing a particular eukaryotic host 
cell for said continuous production and culturing the trans- 
formed eukaryotic host cell under conditions favorable for 
continuous production of said desired heterologous protein. 
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Description 

IMPROVED RECOMBINANT EXPRESSION METHOD, VECTOR AND TRANSFORMED CELLS 

This invention relates to the application of recombinant DNA technology to prepare vectors capable qf 
expressing desired proteins such that continuous production of the protein can be achieved. Furthermore, the 
5 Invention relates to the construction of an expression vector capable of generating stable cytoplasmic mRNA 
so as to give rise to continuous production of the desired protein. In another aspect the invention relates to an 
expression vector having a specific stabilizing sequence positioned 5' to a DNA encoding a desired protein. 
The invention further relates to the transfection of eukaryotic cells with such vectors and choosing of a host 
ceil such that continuous production of the desired protein by that cell line is established. 

10 Recombinant technology has recently been applied to eukaryotic cells, specifically mammalian ceils were 
transformed with heterologous DNA coding for a selectable phenotype. Wigler. <M M et al.. Cell 11: 223-232 
(1977). (t has also been shown that eukaryotic cells can be transformed to yield transformants having 
heterologous DNA Integrated into the chromosomal DNA of the eukaryotic cell nucleus. 
Successful transformation of eukaryotic cell cultures and expression of DNA sequences coding for a 

15 desired protein has been disclosed. See for example, European Patent Publications Nos. 73,659 and 73,666. 
These successful transformations have utilized vectors to express complimentary DNA (cDNA's) requiring 
only 5' control signals such as enhancers (Gluzman, Y and Shenk, T. [eds.] Enhancers and Eukaryotic Gene 
Expression [Cold Spring Harbor Laboratory, 1983]), promoters (Hamer, D. H. et a}., Cell 21, 697 [1980]) and 3' 
polyadenylation sites (Proudfoot, NJ. and Brownlee, Q.G., Nature 263, 211 [1976]). 

30 In 1977 It was found that in eukaryotes the cytoplasmic mRNA is not always co-linear with the DNA. DNA 
sequences encoding proteins were found to be Interrupted by stretches of non-coding DNA. There are long 
stretches of base sequence in the DNA of the gene which do not appear In the final mRNA. It was observed 
that the primary mRNA transcripts were 'spliced' to remove the non-coding sequences, i.e. sequences which 
do not encode a protein. These non-coding sequences in DNA are generally referred to as introns (formerly 

25 referred to as intervening sequences) while the coding sequences are known as exons. RNA polymerase 
makes a primary transcript of the entire DNA, both exons and introns. This transcript was processed so that 
the introns were removed while at the same time the exons were all joined together in the correct order. The 
mechanism producing the foregoing result is referred to as 'splicing.' 
Numerous split or spliced genes have been discovered. In fact, introns exist in virtually alt mammalian and 

30 vertebrate genes and also in the genes of eukaroytlc microorganisms. Introns are not limited to the coding 
region of a message. For example, one intron was found in the leader region of the plasminogen activator 
mRNA before the coding sequence in addition to multiple splice sites elsewhere In the gene. Fisher, R. et aJ M J. 
Biol. Chem. 260, 1122 (1985). There has been considerable speculation about why introns have evolved and 
become suchageneral feature of eukaryotic genes. Crick, F., Science 204. 264, 1979; and, Sharp, P.A., Cell 23, 

35 643-646 (1981). 

Given the ubiquity of introns, tt is not surprising that splicing was studied in the context of recombinant 
technology. For example, an SV40 vector was constructed containing a rabbit P-globin cDNA, regions 
implicated in transcription initiation and termination, splice sites from a multipartite leader sequence located 5' 
to the p-globin cDNA sequence and a polyadenylation sequence. Mulligan, R.C. et a}., Nature 277, 108-114 

40 (1979). This recombinant genome, when Infected into monkey kidney cells, was found to produce hybrid 
mRNAs containing the leader region for the 16S and 19S late RNA and the P-globin coding sequence. This 
hybrid mRNA produced substantial quantities of the rabbit p-globln polypeptide. Mulligan et al. discuss an 
experiment in which mutants lacking splicing capability failed to produce discrete mRNAs. Id. at 109. 
In an attempt to establish the physiological role that RNA splicing plays in gene expression, Hamer, D.H. and 

45 Leder, P., Ceil 18, 1299-1302 (1979) manipulated the location and/or presence of a splice site In SV40 
recombinants transfected into monkey cells. Hamer and Leder. supra, used one splice site located within the 
gene encoding the desired protein or used two splice site sequences, one located 5' to and the second within 
the gene encoding the desired protein. They found that RNA were produced transiently by ail of the viruses 
that retain at least one functional splice junction. They concluded that splicing is a prerequisite for stable RNA 

50 formation. Confirming that result, Gruss, P. et al. PNAS (USA), 76, 4317-4321 (1979) found that construction of 
an SV40 mutant lacking an intervening sequence made no detectable capsid protein. The Gruss paper utilized 
a multipartite leader having seven) splice site sequences. The three papers discussed alt utilize viral vectors 
with numerous splice sites at various locations. These viral vectors differ from the nonviral vectors of the 
instant Invention in several respects. First viral introns are located both 5' and 3* to the transcription unit as 

55 well as within the coding sequence itself. In the instant invention the stabilizing sequence is located 5' to the 
gene encoding the desired protein. Viral vectors continue to replicate independent of the host DNA, do not 
integrate and are lytic. Finally, many viral vectors require early gene function for correct splicing to occur . 

These two papers suggest that RNA splicing may be Important in a recombinant milieu. However, other 
studies abandoned splicing to express proteins using only 5' control signals such as enhancers, and 

60 promoters and 3* polyadenylation sites. In fact, recent work by Reddy, U.B. et al., Transcriptional Control 
Mechanisms, J. Cell. Biochem. Suppl. 10D, 154 (1986), found that the inclusion of Introns In an expression 
vector actually reduced the amount of the desired.protein expressed. The authors concluded that introns were 
not an essential part of vectors for the expression of a desired protein. Hail et aL also observed that including 
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an intron was detrimental to protein expression. It was observed that deletion of the acceptor sequence 
resulted In transient production of unspliced cytoplasmic viral mRNA. Trsismann, R. et al. Nature 292. 595-600 
(1981). These results support the notion that splicing Is not obligatory. 

Straightforward expression using standard recombinant control signals such as enhancers, promoters and 
3' pofyadenylation sites cannot always be achieved. The SV40 promoter without a splice site has been used to s 
direct expression of numerous cDNAs/ (fi-galactosidase, Hall. C.V. et al. J. Mol. Applied Genetics 2,; human 
interferon, Gray, P.W., et ah, Nature 295, 503 (1982); hemagglutinin. Gething, et aj. Nature 293, 620 (1981); 
human lecithin-cholesterol acyltransferase, McLean, J. et al., PNAS 33. 2335 (1986); DHFR, Slmonsen, C.C. et 
al., PNAS 80, 2495 (1983) ; human interteukin-2, Leonard, W.T. et al., Nature 311, 626 (1984); ras-2. Capon, D.J. 
et aJ. Nature 304, 1983; src, Snyder, MA et al., Cell 32, 891 (1983) ; and hepatitis B surface antigen, Crowley, 10 
C.W. et aJ. ( Mol. Cell Biol. 3. 44-55 (1983)). However, no discrete factor Vlll message of correct size was 
detected using an expression vector comprising an SV40 promoter ligated to a cDNA encoding factor Vlll 
transfected into a variety of cells. Transcription of other genes present on the same plasmid. such as DHFR, 
did produce the correct message. Since the SV40 promoter could express mRNA for certain proteins but not 
factor vlll, the problem was identified as relating to either transcription/splicing of a mRNA that would permit 15 
continuous expression or simply a lack of accumulatfoin of the factor vlll message. The former problem is 
referred to herein as one of the 'stability* of the mRNA. 

Numerous experiments using various combinations of transcriptional start signals with the cDNA encoding 
factor Vlll were tried. Cells transfected with such vectors were analyzed for factor Vlll message by Northern 
analysis. No discrete message of the correct size was found. 20 

Experiments were also conducted with introns and splice sites present in the vectors. Okayama, H. and 
Berg. P.. Mol. and Cell. Biol. 3(2) : 280-289 (1983) utilize a plasmid vector, pcD, containing an SV40 early region 
promoter. SV40 late region intron comprising one donor site and two acceptor sites, cDNA and a 
pofyadenylation signal. A vector comprising the adenovirus major late promoter and tripartite leader, having 
three splice sites and a cDNA encoding factor Vlll was constructed as described in European Patent 25 
Publication No. 160,467. This vector was analyzed and found to be randomly successful In directing expression 
of full length factor VIII. This could be explained In part by cryptic spflclng. The tripartite leader region Is spliced 
onto multiple coding regions to yield a final message. The complexity of the splicing pattern is evident from the 
fact that 4 primary transcripts can be differentially spliced to yield 14 discrete messages. Nevins, J. and Wilson, 
M., Nature 290. 113 (1981). The controls for selection of downstream splicing to the coding sequence is not 30 
well understood. However, selection of the appropriate poiyadenylatlon site and transcription termination 
precede the final splicing event and may effect the selection of the 3' splice site. For these reasons and 
because the information content of the base sequences at exon-intron junctions Is relatively small it Is not 
surprising that splicing Is sometimes Incorrect, i.e. cryptic. Hamer et al., Cell 21^, 697-708 (1980) and Mansour 
et al.. Mol. Cell. Biol. 6, 2684 (1986). Cryptic splicing could explain the random success in expressing full length 35 
factor Vlll using the'adenomajor late promoter and tripartite leader. 

Further analyses of vectors containing the adenomajor late promoter was conducted. Adenovectors had 
been used to express other proteins but with a restricted expression pattern suggesting that the adeno 
control regions could function in a limited number of cell types. Levine, A.S. et al.. Virol. 11, 672-681 (1973) and 
Grodzicker, TJ. et al., J. Vlrql. 9, 559-571 (1976). Vectors were constructed using cDNA's from other proteins 40 
such as DHFR or t-PA with theldentical 5' and 3* control regions as described In European Patent Publication 
No. 160,457. Following transfectton of these plasmlds Into Cos. 293, BHK and CHO cells the transfectants were 
monitored for either t-PA expression by immunoperoxidase staining or DHFR expression using methotrexate. 
In summary, at no time were any of these adeno late vectors found capable of expressing t-PA or DHFR In any 
cell types other than 293 or Cos cells. Transient expression of t-PA was reproducibfy seen in 293 or COS cells, 45 
however, factor vlll expression was random under the Identical conditions. These results were confirmed in 
three papers in which the use of a portion of a viral multipartite leader sequence failed to express the desired 
protein. In the first paper, Kaufman, RJ. and Sharp, PA, Mot. Cell. Biol. 2(11), 1304 (1982) constructed a 
vector containing the adenomajor late promoter, Including the first leader and 5' splice donor she of the 
adenovirus tripartite mRNA leader sequence, adjoined to two 3' splice site acceptor sequences Isolated from so 
an immunoglobulin variable-region gene and the DHFR coding region. This vector transfected Into DHFR- 
CHO ceils produced a very tow frequency of DHFR + cells. In a second paper Kaufman, RJ. and Sharp. PA, J. 
Mol. Biol. 159, 601-621 (1982) described the same plasmid and indicated that expression of DHFR was not 
obtained. Id. at 606. Wong, G.C. et a}., Science 228: 610-815 (1985) use an expression vector having: an SV40 
enhancer; the adenovirus major late promoter and tripartite leader sequences; a hybrid Intron consisting of a 55 
5' splice site from the first exon of the tripartite leader and a 3* splice site from a mouse Immunoglobulin gene; 
two cDNAs the first encoding a desired protein, colony stimulating factor, and the second DHFR; SV40 
poiyadenylation sequence ; and. VA gene. This potycistronic vector was found to work only transiently, supra at 
810, required the presence of VA RNA to Increase translatablflty. supra at 811, and required a second cDNA, 
that of DHFR. to Increase mRNA stability, supra at 81 1 . So whlie a restricted transient expression capability 60 
was seen with adenovirus major late vectors which included the entire tripartfte leader for some proteins, 
certain proteins have additional requirements for successful continuous expression. 

A vector was constructed containing a cytomegalovirus promoter and enhancer, a cDNA encoding factor 
Vlll, and a 3 1 terminating sequence, absent any intron or constructed splice site. Neither transient nor stable 

ipresaion of factor Vlll was observed in any of the ceD types tested. 65 
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Another vector absent an Intron or constructed splice site containing the SV40 early transcriptional 
sequences Including the enhancer and promoter, cDNA encoding factor VIII and the SV40 polyadenylation site k . 
produced neither transient nor stable expression. 

A vector containing the SV40 promoter and enhancer, the entire adenomajor late tripartite leader i.e. three 
5 introns with appropriate donor and acceptor sites, cDNA encoding factor VIII and the 3' hepatitis surface 
antigen polyadenylation site produced transient expression of factor VIII only in COS cells but no other cell 
types. 

A vector containing the SV40 enhancer and promoter, the first intron of the adenomajor late tripartite leader, 
an Immunoglobulin (Ig) variable region acceptor site, cDNA encoding factor VIII, and the SV40 potyadenytation 
10 site expressed factor VIII transiently in COS cells but produced no expression in other ceil types. 

A vector was constructed containing an SV40 enhancer and promoter, the first donor site and Intron of the 
adenomajor tripartite leader, the consensus sequence for the Ig variable region acceptor sequence, the cDNA 
encoding factor VIII and the 3' polyadenylation site of the hepatitis surface antigen. This vector failed to provide 
transient or stable expression of express factor VIII. 
15 Yet another vector was constructed comprising the SV40 enhancer and promoter, cDNA encoding factor 
VIII, an SV40 small t-antigen intron 3' to the cDNA, complete with donor and acceptor sequence and the SV40 
early region potyadenytation site. This vector failed to produce either transient or stable expression of factor 
VIII In any cell type. 

Experiments described herein establish that a stabilizing sequence, either a donor-intron-acceptor 
20 sequence or an engineered splice sequence, is necessary for stable expression of certain proteins. Those 
experiments further establish that location of the stabilizing sequence is important for stable continuous 
expression. The present invention is directed to the construction and use of vectors having a specific 
stabilizing sequence positioned 5' to the DNA encoding certain proteins that are difficult to express. The 
expression vector of the instant invention when transf ected into a selected host cell will transfrom that host 
25 cell to one that provides continuous production of a desired protein, e.g. factor Vlll. The Invention is also 
directed to the choice of an appropriate ceil line and transaction of that host cell to establish a cell line for 
continuous production of the desired protein. 

The present invention is based on the discovery that continuous production of some proteins by use of a 
recombinant expression vector requires a particular arrangement of a stabilizing sequence, located 5' to the 
30 DNA encoding the desired protein. Furthermore, the invention relates to a stable cytoplasmic mRNA resulting 
from use of a stabilizing sequence positioned 5' to a DNA encoding a desired protein. In another aspect the 
Invention is directed to the expression vectors constructed in accord with the foregoing which express the 
gene encoding the desired heterologous protein. 
In still another aspect the invention relates to the choice of an appropriate host cell for transfectlon with the 
35 novel vector of the instant Invention. Yet another aspect of the instant invention is the transformation of a host 
ceil to establish a stable cell line for production of the desired heterologous protein. 

Figure 1 Construction of a factor VIII expression vector used to establish production cell lines for factor ? 
Vlli.pF8CIS. 

Figure 2 Construction of a factor VIII expression vector used to establish production cell lines for factor 

40 via. pFescis. 

Figure 3 Immunoperoxidase staining of cells following transfection (A) shows expression following 
transfection with pFBCIS (B) shows expression following transfectton with pFSSCIS. 

Figure 4 immunoperoxidase staining of CHO cells transfected with pFBCIS subject to one round of 
amplification. 

45 Figure 5 Immunoperoxidase staining of CHO cells transfected with pF8CtS and subjected to three 

rounds of amplification. 

Figure 6 Construction of a factor VIII variant expression vector used to establish production celt lines 
for the factor VII I variant pF8CIS9060. 

Figure 7 Immunoperoxidase staining of the cells transfected with the vector pFBCIS9080 encoding the 
50 factor Vlll variant or fusion protein. 

Figure 8 Immunoperoxidase staining of CHO cells transfected with pF8CIS subjected to continuous 
amplification. 

Figure 9 Construction of an expression vector containing a cDNA encoding factor Vlll resistant to 
proteolytic cleavage by activated protein C. pF8CIS-336E. 
55 Figure 10 Construction of an expression vector containing a cDNA encoding a fusion protein of factor 

Vlll resistant to proteolytic cleavage by activated protein C. pF8908O336E. 

Figure 11 SDS-PAGE and Western blot analysis of purified 90kd + 142aa + 80kd fusion. 
Approximately 8 fig of the 90kd + 142aa + 80kd fusion was resolved by SDS-PAGE. Subsequently the 
protein was detected by staining with Coomassie blue (A) or transferred to nitrocellulose for Western blot 
60 analysis (B) . A rabbit polyclonal antibody raised against plasma derived factor Vlll was used to detect the 

90kd + 142aa + 80kd fusion bound to nitrocellulose. 

Figure 12 Thrombin activation of the 90kd + 142aa + 80kd fusion. Approximately 1 1 ug of the purified 
90kd + 142aa + 80kd fusion in 0.05 M Tris. pH 7.4, containing 0.15 M NaCl, 2.5 mM CaCt 2 and 5 percent 
glycerol was incubated with 55 ng of thrombin for 0.1 to 60 minutes at 37*C. At the times indicated an * 
65 aliquot was removed, diluted 1/2000-1/10,000 fold in 0.05 M Tris, pH 7.4 containing 0.01 percent BSA and 
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assayed by coagulation analysis. SDS buffer was added to the remainder of the sample which was heated 
to 90°C for 5 min. then subjected to SDS-PAGE (inset). 

Figure 13 Binding of 90kd + 142aa + 80kd fusion to vWF is shown. The 90kd + 142aa + 80kd fusion 
(275 units) in 0.05 M Tris, pH 7.4. 150 nM NaCI 2 . 2.5 mM CaCIa and 5 percent glycerol was passed through 
a vWF-Sepharose column and the column was subsequently washed with three column volumes of the 
above buffer. The 90kd + 142aa + SOkd fusion was eluted with 0.25 M CaCI 2 . The vWF-Sepharose 
column was prepared by coupling pure WVF to Afflgel 10 (Bio Rad) according to the manufacturers 
specifications. 

Figure 14 Construction of a prorelaxin expression vector used to establish production cell tines for 
prorelaxin. pCIHRX. 

Figure 15 Construction of a prorelaxin expression vector used to establish production cell lines for 
prorelaxin. pCISRX. 

Figure 16 Construction of a t-PA expression vector used to establish production cell lines for t-PA. 
DCIHt-PA. 

Figure 17 Sequence of a portion of pFSCIS. The DNA sequence of the expression vector containing the 15 
cytomegalovirus enhancer, promoter (nucleotides 1-732), stabilizing sequence, I.e. splice donor intron 
sequence the Ig variable region intron and splice acceptor sequence (nucleotides 733-900). 

Figure 18 Sequence of a portion of pF8SCIS. The DNA sequence of the expression vector containing 
the SV40 enhancer and promoter, (nucleotides 1-360) stabilizing sequence which includes cytomegalovi- 
rus donor and intron sequence, the Ig variable region intron and splice acceptor sequence (nucleotides 20 
361-580). 

Figure 19 Sequence of a portion of pF8CSSS. The DNA sequence of the expression vector containing 
the cytomegalovirus enhancer promoter and leader (nucleotides 1-732), stabilizing sequence Including 
the engineered splice donor and acceptor sequence (nucleotides 733-736), the remaining leader. 

Figure 20 Constructions of a t-PA expression vector used to establish production cell lines for t-PA. 25 
DCISt-PA. 

As used herein 'nucleotide sequence' refers to a nucleic acid comprising a series of nucleotides in a 5' to 
3' ohosphate diester linkage which may be either an RNA or a DNA sequence. If a DNA, the nucleotide 
sequence may be either single or double stranded. Similarly, "DNA sequence* refers to both single and double 

stranded embodiments. ^ ^ „ w ^ u 

•Desired heterologous protein' refers to a protein which is desired to be expressed in a host cell, but which 
the host cell either normally does not produce itself or produces in small amounts, and which is not normally 
necessary for the cells continued existence. Such a protein includes any molecule having the pre or mature 
amino acid sequence and amino acid or gtycosytatlon variants (including alleles) capable of exhibiting a 
biological activity In common with said desired heterologous protein. 

•Splicing* refers to the mechanism by which a single functional RNA molecule is produced by the removal of 
one or more internal stretches of RNA during the processing of the primary transcript. Splicing is believed to 
begin with the looping out of the intron so that the 5' end of the intron (referred to as the donor) is juxtaposed 
to the 3' end of the intron (referred to as the acceptor). A comparison of the base sequences at intron-exon 
junctions reveals consensus sequences, with the first two bases at the 5' end of each intron being GT and the 40 
last two bases at the 3' end being AG. 

■Spliced mRNA" refers herein to mRNA produced by either the removal of one or more Internal stretches of 
RNA or by constructing a DNA which when transcribed produces a mRNA having the same properties as a 
mRNA which had been subject to splicing but from which no nucleotide sequence had In fact been removed. 

•Stabilizing sequence' refers to a DNA sequence that gives rise to a spliced mRNA by coding either a splice 45 
donor-mtron-acceptor sequence or by coding a sequence comprising a full consensus sequence or a part 
thereof tor the donor and acceptor sequence and the appropriate nucleotides at the donor/acceptor junction 
such that the resulting mRNA resembles functionally a mRNA which had been spliced. The stabilizing 
sequence Is placed in the leader sequence of the gene encoding the desired heterologous protein. Leader 
sequence" refers to that region of mRNA that is in the 5' untranslated region between the CAP site and the 

AU -rL^sus^^ £AG/GTft AGT found to occur at the exon-intron 

boundary (or donor sequence) and (I )„N $ AG/G found to occur at the intron-exon bounoary(or acceptor 
sequence). See Mount. S.M., Nucleic Acids Research 10(2). 459-172 (1982). Arises of the frequency wfth 
which individual bases occur in particular positions yielded a consensus se ^enceta me do ™ ^ a ^Pl° r 56 
sequences, it is also known that introns begin with GT and end with AG. Breathnach. R. etaJ. PNAS (USA) 75. 
4853-4857 (1978). It is also known that certain multipartite leader sequences in which multiple spacing events 
occur may require additional factors of early gene function to achieve proper processing. See BaWss. LE. et 
al Mol and Cefl. Biol. 5(10). 2552-2558 (1985). One of ordlnaryskin In the art using the knowledge of the donor 
and acceptor consens-us sequences, multipartite leader sequences m wWc !V^^ 
requiring early gene function and the consensus splice sequences rule In accord with the instant invention will 
be able to select a particular stabilizing sequence for a desired protein. 

•Control region' refers to specific sequences at the 5' and 3' ends of eukiryotic genes which may be 
involved in the control of either transcription or translation. Virtually all eukaryotte genes ^^^chre^on 
located approximately 25 to 30 bases upstream from the site where transcription is Initiated. Another 
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sequence found 70 to 80 bases upstream from the start of transcription is a CXCAAT region where X may be 
any nucleotide. At the 3' end of most eukaryotic genes is an AATAAA sequence which may be the signal for 
addition of the polyadenylatlon tail to the 3' end of the transcribed mRNA. 
"Promoter* refers to the nucleotide segment recognized by RNA polymerase molecules that start RNA 
5 synthesis. Promoters controlling transcription from vectors in mammalian host ceils may be obtained from 
various sources for example, the genomes of viruses such as: polyoma, Simian Virus 40 (SV40), adenovirus, 
retroviruses. hepatttis-B virus and most preferably cytomegalovirus, or from heterologous mammalian 
promoters, e.g. beta actin promoter. The early and late promoters of the SV40 virus are conveniently obtained 
as an SV40 restriction fragment which also contains the SV40 viral origin of replication. Fiers et a}., 1978, 

w 'Nature", 273: 113. The Immediate early promoter of the human cytomegalovirus is conveniently obtained as a 
Hindlll E restriction fragment. Greenaway, P J. et aj„ Gene 18, 356-3S0 (1982). Of course, promoters from the 
host cell or related species also are useful herein. 

"Enhancer" refers to cis-acting elements of DNA, usually about from 10-300 bp, that act on a promoter to 
increase Its transcription. Transcription of a DNA encoding a desired heterologous protein by higher 

15 eukaryotes is Increased by Inserting an enhancer sequence into the vector. Enhancers are relatively 
orientation and position independent having been found 5' (Laimins, L et al. f PNAS 78, 993 [1981]) and 3' 
(Lusky, M.L, et ah, Mol. Ceil Bio. 3, 1108 [1983]) to the transcription unit, wtthin an Intron (Banerji, J.L et a!., 
Cei! 33, 729 [1983]) as well as within the coding sequence Itself (Osborne, T.F., et aJ., Mol. Cell Bio. 4,~1293 
[1984]). Many enhancer sequences are now known from mammalian genes (globln, elastase, albumin. 

20 a-fetoprotein and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples 
Include the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human or nucleated 
ceils from other multicellular organisms) will also contain sequences necessary for the termination of 

25 transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments 
In the untranslated portion of the mRNA encoding the desired heterologous protein. The 3' untranslated 
regions also include transcription termination sites. 

Expression vectors may contain a selection gene, also termed a selectable marker. A selection gene 
encodes a protein, sometimes referred to as a secondary protein, necessary for the survival or growth of a 

30 host cell transformed with the vector. Examples of suitable selectable markers for mammalian cells are 
dihydrofblate reductase (DHFR), thymidine kinase or neomycin. When such selectable markers are 
successfully transferred into a mammalian host cell, the transformed mammalian host cell can survive if placed 
under selective pressure. There are two widely used distinct categories of selective regimes. The first category 
is based on a cell's metabolism and the use of a mutant cell line which lacks the ability to grow independent of 

35 a supplemented media. Two examples are: CHO DHFR- cells and mouse LTK- cells. These cells lack the 
ability to grow without the addition of such nutrients as thymidine or hypoxanthine. Because these cells lack 
certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless the missing , 
nucleotides are provided in a supplemented media. An alternative to supplementing the media is to introduce 
an intact DHFR or TK gene into cells lacking the respective genes, thus altering their growth requirements. 

40 Individual cells which were not transformed with the DHFR or TK gene will not be capable of survival in 
non-supplemented media. Therefore, direct selection of those cells requires cell growth in the absence of 
supplemental nutrients. 

The second category is dominant selection which refers to a selection scheme used In any cell type and 
does not require the use of a mutant cell line. These schemes typically use a drug to arrest growth of a host 

45 cell. Those cells which have a novel gene would express a protein conveying drug resistance and would 
survive the selection. Examples of such dominant selection use the drugs neomycin, Southern P. and Berg. P., 
J. Molec. Appl. Genet 1, 327 (1982). mycophenolic acid, Mulligan. R.C. and Berg, P. Science 209, 1422 (1980) 
or hygrornycin. Sugden, B. et aJ. f Mol. Cell. Biol. 5:410-413(1985). The three examples grvenlbove employ 
bacterial genes under eukaryotic control to convey resistance to the appropriate drug neomycin (G418 or 

50 geneticin), xgpt (mycophenolic acid) or hygrornycin, respectively. In the following experiments the selective 
agent of choice Is most often Q41 8 geneticin unless specifically referring to CHO DHFR - cells. In this case the 
direct selection for DHFR production was used. 

"Amplification* refers to the increase or replication of an isolated region within a cell's chromosomal DNA. 
Amplification is achieved using a selection agent e.g. methotrexate (M7X) which Inactivates DHFR. 

55 Amplification or the making of successive copies of the DHFR gene results in greater amounts of DHFR being 
produced in the face of greater amounts of MTX. Amplification pressure Is applied notwithstanding the 
presence of endogenous DHFR, by adding ever greater MTX to the media. Amplification of a desired gene can 
be achieved by cotransfectlng a mammalian host cell with a plasmld having a DNA encoding a desired protein 
and the DHFR or amplification gene so that cointegratton can occur. One ensures that the cell requires more 

60 DHFR, which requirement Is met by replication of the selection gene, by selecting only for cells that can grow 
in successive rounds of ever-greater MTX concentration. So long as the gene encoding a desired 
heterologous protein has colntegrated with the amplrfiabie gene, replication of this gene gives rise to 
replication of the gene encoding the desired protein. The result Is that increased copies of the gene, i.e. an 
amplified gene, encoding the desired heterologous protein express more of the desired heterologous protein. * 

65 Preferred suitable host ceils for expressing the vectors of the Instant Invention encoding the desired 
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heterologous proteins In higher eukaryotes include: monkey Wdney CVI line transformed by SV40 (COS-7, 
ATCC CRL 1651); human embryonic kidney line (293, Graham, F.L et al. J. Gen Virol. 36. 59 [1977]); baby 
hamster kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary-cells-OHFR (described by Urlaub and 
Chasin. PNAS (USA) 77, 4216, [1980]); mouse Sertoli cells (TM4, Mather, J.P., Biol. Reprod. 23. 243-251 
[1980]); monkey kidney cells (CVI ATCC CCL 70); african green monkey kidney cells (VERO-76, ATCC 5 
CRL- 1587) ; human cervical carcinoma cells (HELA, ATCC CCL2); canine kidney cells (MOCK, ATCC CCL 34); 
buffalo rat liver cells (BRL 3A, ATCC CRL 1442): human lung cells (W138. ATCC CCL 75); human liver cells 
(Hep G2. HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51): rat hepatoma cells (HTC, M1.54, 
Baumann, H. et al.. J. Cell Biol. 85. 1-8 [1980]): and. TRI cells (Mather, J.P. et al.. Annals N.Y. Acad. Sci. 383 , 
44-68 [1982]). " W 

"Transformation" means introducing DNA into an organism so that the DNA Is repltoable. either as an 
extrachromosomal element or by chromosomal Integration. Unless otherwise provided, the method used 
herein for transformation of the host : is is the method of Graham, F. and van der Eb, A., Virology 52, 456-457 
(1973). 

Host cells may be transformed with the expression vectors of the instant Invention and cultured In 15 
conventional nutrient media modified as Is appropriate for inducing promoters, selecting transformant3 or 
amplifying genes. The culture conditions, such as temperature, pH and the like, are those previously used wtth 
the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. 

Transf action" refers to the taking up of an expression vector by a host cell whether or not any coding 
sequences are in fact expressed. Numerous method of transfection are known to the ordinarily skilled artisan, 20 
for example, CaPCU and electroporation. Successful transfection is generally recognized when any Indication 
of the operation of this vector occurs wtthin the host cell. However, In the context of the present Invention 
successful transfection refers to stable continuous expression of a desired heterologous protein by a host 
culture over numerous generations. 

Choosing of the host production cell Is achieved by screening for transient expression and then unamplrfed 25 
expression using the method of the instant invention. Vectors were screened for transient expression to 
determine which vectors could be used to express a desired heterologous protein. Transient expression 
provides an Indication of whether the particular plasmid that has been taken up functions, i.e., is transcribed 
and translated to produce the desired protein. During this time the plasmid DNA which has entered the cell is 
transferred to the nucleus. The DNA is in a nonintegrated state, free wtthin the nucleus. Transcription of the 30 
plasmid taken up by the cell occurs during this period. Vectors which were Identified as capable of producing 
the desired heterologous protein transiently were then used to establish a stable continuous production cell. 
Transient expression refers to a short period (12-72 hrs) following transfection. Following this Initial period 
after transfection the plasmid DNA becomes degraded or diluted by cell division. Random integration within 
the cell chromatin occurs. Screening the cells after two to three weeks of unamplifled expression is an indicia 35 
of cells which have retained the recombinant DNA leading to a permanent cell line. 

An assay based on immunoperoxidase staining of a transfected cell was developed to assess quickly 
whether a desired heterologous protein had been expressed. (Gorman, CM. et al., Cell 42, 519-522 [1985]). 
Monoclonal antibodies specific for the desired heterologous protein were screened for use in this assay. Host 
ceils containing the vector were stained and compared to parental ceil line for screening cells which produce a 40 
specific protein. A monoclonal antibody was selected which gave the strongest signal with the least amount of 
background. Transient transfectlons were performed to test vectors for the ability to produce a desired 
protein. Ceils (Cos, 293. CHO. BHK, TM4) were transfected using the CaP04 technique. (Graham and van der 
Eb modified by Gorman, CM. et al.. Science 221, 551-553 (1983)). We used ten micrograms per milliliter of 
precipitate of the specific protein vector to be tested. The precipitates were left on the cells for 3-4 hours/Cells 45 
were then glycerol shocked for an average of one minute. Thirty-six hours after transfection cells were fixed 
with acetone-methanol (50:50) and washed with phosphate buffer saline (PBS). Staining was performed using 
either a monoclonal antibody supernatant undiluted or purified antibody diluted 1 :3000 In PBS containing 10Ato 
fetal calf serum. This first antibody remained on the cells 2 hours. Plates were placed on a slow shaker during 
this time. Cells were washed 5 times over a ten minute period. The second antibody used was rabbit 50 
anti-mouse IgG (Dakopatts). This was diluted in PBS + fetal calf serum at a dilution of 1:150. A two hour 
incubation was followed by another series of washes. To develop the peroxidase reagent orthodtansidlne was 
used as a substrate. An ethanol saturated solution of ortho-dlansidine was diluted 1 : 100 in PBS with 1 : 10,000 
dilution of hydrogen peroxide. This substrate was left on the cells for 2 hrs at room temperature or overnight at 
4°C 55 

By this method a wide variety of vectors encoding the desired protein were quicWy screened for the ability to 
direct protein expression. 

Coatest Factor VIII was purchased from Helena Laboratories, Beaumont TX (Cat No. 5293). The procedure 
used was essentially that provided by the manufacturer for the "end point method" for samples containing less 
than five percent protein. 60 

Production ceO lines were established using plasmlds of the instant invention which were shown to function 
transiently in a wide variety of cells. Expression vectors were transfected Into a number of ceil lines. For these 
transf ections a total of 10 ug of DNA/ml precipitate were used. Selection for expression was made possible In 
these cells using a selectable marker as described above. All cells were transfected with modified CaPA4 
technique except BRL cells which were found to be sensitive to calcium. Electroporation was used wtth these 65 
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cells. Transfected cells were selected from suitable host cells as previously described. 

The protocol used to establish production cell lines relied heavily on the staining method described above. 
Two days following transfection, cells were subcultured into a selection media. Media was titrated for the 
proper amount of the particular substance needed for selection. At the same time that cells were transfected 
5 to establish a production ceil line, a dish of each cell type was assayed for transient expression. 

In order to simplify the following examples certain frequently occurring methods and/ or terms win be* 
described. 

'Plasmlds' are designated by a lower case p preceded and/or followed by capita) letters and/or numbers. 
The starting plasmids herein are either commercially available, publicly available on an unrestricted basis, or 

10 can be constructed from available plasmids in accord with published procedures. In addition, equivalent 
plasmids to those described are known in the art and will be apparent to the ordinarily skilled artisan. 

•Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme that acts only at certain 
sequences, restriction sites, in the DNA. The various ; istrlction enzymes used herein are commercially 
available and their reaction conditions, cofactors and other requirements were used as would be known to the 

15 ordinarily skilled artisan. For analytical purposes, typically 1 us of piasmid or DNA fragment is used with 
about 2 units of enzyme in about 2 uJ of buffer solution. For the purpose of Isolating DNA fragments for piasmid 
construction, typically 5 to 10ug of DNA would be digested with 20 to 40 units of enzyme In a larger volume. 
Appropriate buffers and substrate amounts for particular restriction enzymes are specified by the 
manufacturer. Incubation times of about one hour at 37° C are ordinarily used, but may vary in accordance with 

20 the supplier's instructions. After digestion the reaction was run directly on a gel to isolate the desired 
fragment. 

■Dephosphorytation" refers to the removal of the terminal 5' phosphates by treatment with bacterial alkaline 
phosphatase (BAP). This procedure prevents the two restriction cleaved ends of a DNA fragment from 
'circularizing" or forming a closed loop that would impede insertion of another DNA fragment at the restriction 

25 site. Procedures and reagents for dephosphoryiation are conventional. Maniatis, T. et a}., 1982, Molecular 
Cloning pp. 133-134. Reactions using BAP are carried out In 50mM Tris at 68°C to suppress the activity of any 
exonucleases which may be present In the enzyme preparations. Reactions were run for one hour. Following 
the reaction the DNA fragment Is gel purified. 
•Oligonucleotides" refers to short length single or double stranded polydeoxynucleotides which are 

30 chemically synthesized by known methods and then purified on polyacryiamide gels. 

•Ligation" refers to the process of forming phosphodiester bonds between two double stranded nucleic 
acid fragments {Maniatis, T. et a}., ]d M p. 146). Unless otherwise provided, ligation may be accomplished using 
known buffers and conditions wtth 10 units of T4 DNA ligase ("ligase") per 0.5 u.g of approximately equimolar 
amounts of DNA fragments to be ligated. 

35 "Filling' or "blunting" refers to the procedures by which the single stranded end in the cohesive terminus of 
a restriction enzyme-cleaved nucleic acid is converted to a double strand. This eliminates the cohesive 
terminus and forms a blunt end. This process Is a versatile tool for converting a restriction cut end that may be 
cohesive with the ends created by only one or a few other restriction enzymes into a terminus compatible with 
any blunt-cutting restriction endonuclease or other filled cohesive terminus. Typically, blunting is 

40 accomplished by incubating 2-15ug of the target DNA in 10mM MgCte, 1mM dfthiothreitol. 50mM NaCI, 10mM 
Tris (pH 7.5) buffer at about 37° C in the presence of 8 units of the Klenow fragment of DNA polymerase I and 
250 jxM of each of the four deoxynucleoside triphosphates. The Incubation generally is terminated after 30 min. 
phenol and chloroform extraction and ethanol precipitation. 
"Northern" blotting is a method by which the presence of a cellular mRNA is confirmed by hybridization to a 

45 known, labelled oligonucleotide or DNA fragment. For the purposes herein, unless otherwise provided, 
Northern analysis shall mean electrophoretic separation of the mRNA on 1 percent agarose in the presence of 
a denaturant (formaldehyde 74b), transfer to nitrocellulose hybridization to the labelled fragment as described 
by Maniatis. T. et ah. Id., p. 202. 
The following examples merely Illustrate the best mode now known for practicing the invention, but should 

SO not be construed to limit the invention. All literature citations herein are expressly Incorporated by reference. 

Example 1 
Expression Vector 

55 

Factor VIII 

1. Construction of Expression Vectors 
The cDNA encoding human factor Vlli was used in the construction of plasmids which would direct the 
60 expression of factor VIII protein in transfected mammalian cells (Wood. W, et aJ., Nature [Lond.] 312:330-337 
[19841). Those transformed mammalian cells secreted approximately .14 mil/ml of factor VIII. The Instant 
method provides continuous production of factor VII) with yields significantly greater. 
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a) pF8CIS 

The vector pF8C!S containing the cytomegaJovirus enhancer (Boshart, M. et a)., Cell 41, 520 [1985]) and 
promoter (Thomson, D.R. et aJ M PNAS (M , 659-663 [1 984]), the cytomegaJovirus splice donor site and a portion 
of an intron (Sternberg. R.M. et a). T. of Virol .49, 190-199 [1984]). the Ig variable region intron and splice 
acceptor site, the cDNA encoding factor Vlil and the SV40 polyadenylation site was constructed. s 

Figure 1 shows the steps for construction of the factor Vtll expression vector used to establish production 
cell lines for factor VIII. The three parts of the construction are detailed below. 

1) The ampicillin resistance marker and replication origin of the final vector was derived from the starting 
plasmid pUC13pML a variant of the plasmid pML (Lusky, M. and Botchen, M., Nature 293. 79 [1981]). 
pUC13pML was constructed by transferring the polylinker of pUC13 (Veira, J. and Messing, J., Gene W 
19:259(1982)) to the EcoRI and Hindlll sites of pML. A second starting plasmid p(JC8CMV was the source of 

the CMV enhancer, promoter and splice donor sequence. pUCSCMV was constructed by inserting 
nucleotides 1 through 732. shown in Figure 17, for the CMV enhancer, promoter and splice donor sequence 
into the blunted Pat! and Sphl sites of pUC8. Veira. J. and Messing.. J. supra. Synthetic Bam Hl-Hlndltl linkers 
(commercially available from New England Biolabs) were (Igated to the cohesive Bam HI end creating a Hindlll 15 
site. Following this ligation a HlndlH-HIncll digest was performed. This digest yielded a fragment of 
approximately 800bp which contained the CMV enhancer, promoter and splice donor site. Following gel 
isolation this 800bp fragment was iigated to a 2900bp piece of pUC13pML The fragment required for the 
construction of pFSCIS was obtained by digestion of the above intermediate plasmid with Sail and Hindlll. This 
3123bp piece contained the resistance marker for ampicillin, the origin of replication from pUC13pML and the 20 
control sequences for the CMV Including the enhancer, promoter and splice donor site. 

2) The Ig variable region intron and splice acceptor sequence was constructed using a synthetic oligomer as 
shown in the central portion of Figure 1. A 99 mer and a 30 mer were chemically synthesized having the 
following sequence for the IgG intron and splice acceptor site (Bothwell et a}., 1981); 

25 

1 ' ' ACTACCAACCTTGACCTGTCGCACGCTTGA . . . 
31 GATCTCCCCATACACTTGACTGACAATCA. . . 

60 CATCC^CmCCCTTTCTCTCCACACCT... 30 

88 GTCCACTCCCAC 3 ' 

1 3 ' CACCTGACCCTCCACCTTCACCTCGTCCCA 5 ' 35 



DNA polymerase I (Klenow fragment) filled En the synthetic piece and created a double stranded fragment. 
Wartell, R.M. and W.S. Raznlkoff, Qene 9, 307 (1980). This was followed by a double digest of Pstl and Hindlll. 
This synthetic linker was cloned into pUC13 (Veira, J. and Messing, J., Gene 19, 259 [1982]) at the Pstl and 
Hindlll sites. The clone containing the synthetic oligonucleotide, labelled pUCIg.10, was digested with Pstl. A 
Cta l site was added to this fragment by use of a Pstl-Oal linker. Following digestion with Hindlll a 1 18bp piece 
containing part of the Ig intron and the tg variable region splice acceptor was gel Isolated. 

3) The third part of the construction scheme replaced the hepatitis surface antigen 3 / end with the 
polyadenylation site and transcription termination site of the early region of SV40. A vector, pUC.SV40 
containing the SV40 sequences was inserted into pUC8 at the Bam HI site described In VIera, J. and Messing, 
j., supra. pUC.SV40 was then digested with EcoRI and Hpal. A 143bp fragment containing only the SV40 
polyadenylation site was gei isolated from this digest Two additional fragments were gel isolated following 
digestion of pSVE.8c1D. European Patent Publication No. 150,457. The 4.8 kb fragment generated by Eco RI 
and Ctal digest contains the SV40-DHFR transcription unit, the origin of replication of pML and the ampicillin 
resistance marker. The 7.5 kb fragment produced following digestion wtth Cjal and Hpa l contains the cDNA for 
factor VIII. A three-part ligation yields pSVE.8c24D. This intermediate plasmid was digested by Clal and Sail to 
give a 9611 bp fragment containing the cONA for factor VIII with an SV40 polyadenylation and transcription 
termination sites foltowed by the SV40 DHFR transcription unit 

The final three part ligation to yield pF8CIS used: a) the 3 123 bp San Hindlll fragment containing origin of 
replication, the ampicillin resistance marker and the CMV enhancer, promoter and splice donor; b) The 118bp 
HindllKCtol fragment containing the Ig intron and splice acceptor; and. c) a 9611 bp Cjal-Safl fragment 
containing the cDNA for factor Vill, SV40 polyadenylation site and the SV40 DHFR transcription unit. A portion 
of the sequence of the expression vector pF8CIS Is shown In Figure 17. 



b) pFSCSSS 

The vector pF8CSSS containing the cytomegalovirus enhancer and promoter, an engineered stabilizing 
sequence, the cDNA encoding factor VIII and the SV40 polyadenylation site was constructed. The entire intron 
region Including donor and acceptor sequences was deleted and replaced by an engineered stabilizing 
sequence. The stabilizing sequence is a synthetic double stranded oligomer having a sequence of the mature 
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mRN A following splicing. The stabilizing sequence was inserted between the unique Sacll-Ctal sites of pF8CIS. 
The sequences of the synthetic oligomers are as follows: 

Sacll 

5 ' GGCCGCCAACGGTGATTGGAACCCG 
3 ' CGCCCGCCCTTCCCACTAACCTTGCGC 

5 ' GATTCCCCG7GCCAAGAGTGACSSIGT 
CTAACCGCCACGCTTCTCACTG££ACA 

5'CCACTCCuAC GTCCAACTCC 
CGTGAGGGTG CAGCTTGACG 

15 5 ' ACCTCCCCTTCGAAT3 9 

TCGACCCCAAGCTTAGC5 ' 
£1*1 

20 The synthetic oligomers comprise the appropriate nucleotides of the donor and acceptor consensus splice 
sequences. The juxtaposition of the splice donor sequence to the splice acceptor sequence is Indicated by the 
underline. This vector resembles the pF8CtS vector discussed above except for the deletion of the intron 
portion and replacement wtth an engineered stabilizing sequence. This construction eliminates the actual 
splicing of the noncoding region from recently the transcribed mRNA. A portion of the sequence of the 

25 expression vector pF8CSSS containing the engineered stabilizing sequence is shown in Figure 19. 

c) pF8SC!S 

The vector pFSSCIS containing the SV40 enhancer and promoter, the cytomegalovirus splice donor site and 
a portion of the intron, the Ig intron and splice acceptor site, the cDNA encoding factor VIII and the SV40 
30 polyadenylation and transcription termination sites were constructed. 
Figure 2 shows the construction of pF8SCIS. 

This vector was constructed using a three part ligation. The preparation of each of the three fragment of 
DNA used tn this ligation is described below: 
The first fragment contained the SV40 early region promoter and enhancer and one half the ampicillin 
35 resistance marker which was obtained from plasmid pML The starting plasmid for the first of three fragments 
was pAML3P.8CI. European Patent Publication No. 160,457. This plasmid was cut with Sac l. Using the whole 
enzyme DNA polymerase I this 3' overhang created by Sac l was blunted. Following this reaction the plasmid 
was cut with Pvu l. The desired 434bp fragment was Isolated from an acryiamide gel. 

The second and third fragments used in this construction were isolated from the plasmid pF8ClS which is 
described above. 

Fragment 2 contained the splice donor from CMV immediate eariy gene and part of the following intron and 
the intron and splice acceptor synthetically made as described above. pF8CIS was cut with Sacll and the 
resulting 3' overhang was blunted by the use of DNA polymerase I. This reaction was followed by cleavage with 
Clal . Since the sequence surrounding the Clal site In pF8CIS prevents cleavage if the plasmid is grown in a 
methylation plus strain, pF8CIS was prepared from dam- strain GM48. Marinus, M'.G. and Maris, N.R., 
Bacterid. 114, 1143-1150 (1973) and Geier, G.E. and Madrid, P., J. Biol. Chem. 254, 1408-1413 (1979). Since 
both Sad l and Clal are unique sites tn this vector the 231 bp fragment was easily isolated from an agarose gel. 

The third fragment contains the cDNA for factor VIII, SV40 eariy region polyadenylation site, a SV40-DHFR 
transcriptional unit, the origin of replication of pML and half of the ampicillin gene. The 1 1308 bp fragment was 
prepared by digestion of pFSCiS (dam-) wtth Oal and Pvu l. 

The three part ligation cresting pFBSCIS destroys the Sacl and Sacll sites, maintains the Clal site and 
reconstructs the amprgene at the Pvu l site. A portion of the nucleotide sequence of the expression vector 
pFSSCtS is shown in Figure 18. 



40 



45 



50 



55 



60 



65 



Example 2 

Analysis of Expression 
1. Transient Expression 

Factor VIII expression was assayed based on Immunoperoxidase staining of transfected cells. Gorman etal. 
Ceil 42, 519-526 (1985). This assay was used to test vectors for the expression of factor VIII. Twelve 
monoclonal antibodies specific for factor VIII were screened for use In this assay. BHK 31A3B cells (European 
Patent Publication No. 160,457) were stained and compared with parental BHK line to screen cells which 
produce factor VIIL Monoclonal antibody BH6 was found to give the strongest signal with the least amount of 
background. Transf actions were performed and transient expression of factor VIII was assessed. Cells (Cos, 
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293. CHO, BHK. TM4) were transf ected using the CaPO* technique. Ten micrograms per milliliter of factor VIII 
vector precipitate was tested. The precipitates were left on the cells for 3-4 hours. Ceils were then glycerol 
shocked for an average of 1 minute. Thirty-six hours after transfection cells were fixed with acetone-methanol 
(50:50) and washed with phosphate buffer saline (PBS). Cells were stained using either BH6 supernatant 
undiluted or purified BH6 antibody diluted 1 :3000 in PBS containing 10Qfo fetal caff serum. This first antibody 5 
remained on the cells for 2 hours. Plates were placed on a slow shaker during this time. Cells were washed 5 
times over a ten minute period. A second antibody of rabbit anti-mouse IgG (Dakopatts) was diluted in PBS + 
fetal calf serum at a dilution of 1:150. A two hour incubation was followed by another series of washes. 
Ortho-diansidine (Sigma) was used as a substrate for developing the peroxidase reagent. A ethanol saturated 
solution of ortho-diansidine was diluted 1:100 in PBS with 1:10.000 dilution of hydrogen peroxide. This w 
substrate was left on the cells for 2 hrs at room temperature or overnight at 4°C. 

This method provided a screen for those factor VIII vectors directing factor VIII expression. This method 
determines transient factor VIII exp -esston. Staining thirty-six hours after transfection provides an indication of 
whether the vector was transcribed and the mRNA translated. 

pF8CIS directed transient expression of factor VHI In at least five different cell lines: COS, 293. CHO, TM4 is 
and BHK. Figure 3A shows transient expression of the vector pF&CIS in CHO cells. 

pF8SClS was found to direct transient expression of factor VIII as efficiently as pF8CIS. Figure 3B shows 
transient expression of the vector pFBSCIS in CHO cells. Since the CMV enhancer and promoter can be 
completely replaced by the analogous SV40 enhancer and promoter, factor VIII production is not dependent 
on the specific transcriptional start signal but rather Is dependent on other parts of the control region such as 20 
the stabilizing sequence site in the vector. 

At the same time that ceils were transf ected to establish a production ceil line, a dish of each cell type was 
assayed for transient expression. Results of the transient expression screen for factor VIII produced two 
classes of cells: those cell types which stained positively for factor VIII thirty-six hours after transfection 
(Category 1); and. those cell types having no detectable transient expression of factor VIII (Category 2). The 25 
host cells comprising each category are indicated below: 



Cateeorv 1 






CHO 


MDCK 


30. 


293 


BRL 




BHK 


Hela 




TM4 


Vero 


35 


HTC 


V138 




COS 


CV1 




HepC2 




40 


TR1 







As discussed above deletion of the Ig variable region intron and donor and acceptor sites, while maintaining 
the other control regions, resulted in elimination of transient expression of factor VIII. From this data at least 
one splice donor-intron-acceptor sequence appears to be required for expression. 

Additional experiments indicate that location of the stabilizing sequence is important. For example location 
of an intron 3* to the cDNA encoding factor VIII failed to express factor VIII. Vectors which were constructed to 
include native factor VIII splice sites, i.e. splice sites within the coding region, also proved unsuccessful. The 
splice donor-acceptor arrangement containing the CMV spfice donor sequence and a chimeric intron 
comprising CMV sequences and the synthesized Ig variable region mtron and acceptor Is an example of a 
stabilizing sequence which win lead to the establishment of a cell line providing continuous production of 
factor VIII. 

Z Continuous Production 

Production cell tines were established by transf ec tin g me plasmids, containing a stabilizing sequence and 
shown to function transiently In a wide variety of ceils, into a number of cell lines. For these transf actions a total 
of 10 ug of DMA/ ml precipitate was used. For transfection of CHO DHFR cells 4 ug of factor VIII plasmid was 
added to 6 ug salmon sperm DMA, which served as a carrier. Wigler, M. at aL. supra . Direct selection for 
expression of DHFR gene was possible in these cells. All other cad types required cotransfectlon with a 
plasmid expressing neomycin gene. Davies, J. and Jenning, A., Am. J. Trap. Med. Hyg. 29(5). 1089-92) (1980). 
Ether pSVENeoBa16 (European Patent Publication No. 160,467) or pRSVneo (Gorman, et al. Science, 221^ 
551-653 [1983]) were used. For these transfections 4ug factor VIII plasmid 1 ug of neomycin containing 
plasmid and 5 ug salmon sperm carrier were used. All cells were transfected using modified CaP04 technique 
except for BRL as discussed above. Transfected cells Included: BHK, CHO-DHFR, CV1. Vero. WI38, 293. TM4, 



11 



0 260 148 



Hela, MOCK, HTC, BRL, TR1 and HepQ2. 

The protocol used to establish production lines relied heavily on the staining method described above. Two 
days following transaction cells were subcultured into selection media lacking glycine, hypoxanthine and 
thymidine for the CHO DHFR cells or G418 containing media. Levels of G418 were titrated for the proper 

5 amount needed for selection. 

Three to four weeks following the onset of selection, cells were screened for stable expression of factor vlll. 
A dish of clones was stained to determine the percentage of clones expressing factor VIII. Following this 
determination twelve clones of each cell type were picked for staining. Clones which scored positive at this 
time indicating stable expression were then also assayed quantitatively by Coatest assay. 

w Those cells which failed to demonstrate transient expression did not demonstrate stable expression at this 
time e.g. Vero, HeLa. Stable expression of factor Vlil was observed in two catagories of cells: 1) some cells 
which expressed factor vlil transiently scored negative during this first round of stable expression e.g. CHO 
and BHK cells failed to show stable expression levels high enough to stain; and 2) cell iines which stained 
positively at both transient and unamplified stages e.g. TM4. HTC and 293 which are then referred to as 

15 potential production cell lines. 



20 



25 







TM4 


CHO 


HTC 


BHK 


293 


Vero 




Hela 




CV1 




WI3B 




BRL 



Low levels of factor Vlil in CHO cells were assayed by coatest. Results of the coatest assay were as follows : 
Faetor VIII Levels in Unamplified Cells 
Host Cells mU/10 4 cells/day 



40 


TM4 


1.8 




HTC 


0.5 




293. 


2.5 


45 


CHO 


<0.15 




BHK 


<0.15 



The results of the foregoing assay demonstrate varying levels of factor Vin production within the class of 
production host cells. The results of Immunoperoxidase staining of transfected cells for transient expression 
at 3fr48 hours followed by staining of unamplified ceils three to four weeks thereafter was predictive for using 
a particular ceil type as a production cell for factor VIII. If the cells did not stain positively indicating an absence 
of factor VIII expression at the transient and unamplified levels that host cell is unlikely to serve as a production 
ceil line for factor VIII. This conclusion is supported by the results of staining CHO, BHK, KTC. TM4 and 233 
cells for factor Vlil expression after rounds of amplification. 

Following the first round of amplification (MTX'JOOnM) clones were again isolated from the foregoing 
transforming cell types and analyzed for production of factor Vlil. Results of factor VIII expression after this 
first round of amplification were as follows: 
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Factor VIII levels after One Round of Amplified aw 

4 

Host Cells n0/10 cells/day 

5 

3.0 
2.3 

0.1 mU/ml 

' w 

0.1 aU/ml 



TM4 
HTC 
GHO 
BHK 



Both clones and mass populations were kept for further amplification. Though factor VIII was monitored 75 
transiently in both CHO and BHK ceils, few clones ware identified after one round of MTX amplification and 
upon continued passage of cells the low levels of factor VIII were lost. Heterogeneity was seen wtth these 
clones. Fig. 4. CHO cells which make factor VIII had a greatly increased doubting time of 45-52 hrs and were 
overgrown by non-expressing cells in the population which have a doubting time of 28 firs. Careful study 
demonstrated that continuous CHO clones expressing factor VIII were difficult to establish. The data 20 
presented below shows the frequency of clones expressing factor VIII In DHFR positive CHO clones at both 
the unampltfied level and after one round of amplification. TM4 cells are shown for comparison. The number of 
clones which stained positive for factor VIII is given as a percentage of the number of stable clones obtained 
following transection. 

25 

1st Round 

Cell Type VsCtPr UnflTTlPUf ltd Amplification 



TM4 pFSCXS 18% 77% & 

pF8SCIS 15% 90% 

CHO pFBCIS 0 0.1% 

pFSSCIS 0 0.1% 35 



At the second or third round of amplification, usually approximately 1 uJd methotrexate, factor VIII was 
detectable in CHO cells. Even at this high level of amplification, activity of factor VIII was tow, 63 mU/ml. 
Continued amplification did not lead to increased production in CHO celt lines. Morphological analysis of three 40 
separately derived CHO lines show the celts staining for factor VIII to be enlarged in size and flattened. Fig. 5. 
Transformed CHO cells amplified to 10 \iM produced no more than 200 mU/ml. These results indicate that the 
choice of a host cell is an important step in the establishment of a production celt system for factor VIII. 
Presently TM4, HTC and 233 have been used to establish permanent ceil lines providing continuous 
production of factor VIII, thus qualifying as production cell lines. 



Example 3 



Coagulant Activity of Factor VIII 

The expression and secretion of active factor VIII from TM4 cells was determined by coagulation analysis. 50 
Serum free media that had been conditioned for 48 hours by TM4 cells transfected wtth pFSCIS was assayed 
for factor VIII. As shown In Table 1 TM4 culture media shortened the clotting time of hemophilic plasma. Most 
of this coagulant activity was neutralized when TM4 media was preincubated wtth a polyclonal antibody against 
human factor VIII. 
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5 


Sample 


Clot Time 


Units/ml 




pd VIII 


46.5 


0.5 


10 


pd VIII + factor VIII 
antibody 


91.5 


<0.01 




TM4 media 


54.2 


0.36 


15 


TM4 media + factor VIII 
antibody 


70.4 


0.05 



Table 1. Secretion of active factor VIII from TM4 celts. TM4 media was either prelncubated wtth 10 jig of a 
human factor VIII polyclonal antibody or with no addition for 30 minutes at 37° C. Subsequently the media was 
diluted 1 :1 wtth 0.05 M Trls pH 7.5 containing 0.01% BSA and assayed by coagulation analysis. Purified plasma 
derived factor VIH {pd VIII) was treated similarly. 

25 

Example 4 
Expression Vector 
30 Variant Factor VIII 

One approach to achieve a more efficient protein is protein engineering. That is, by Introducing changes 
within the gene at the DNA level, variants can be produced in cell culture to allow for specific modification in 
protein function. Three variants were engineered. The native factor VIII single chain 300,000 daiton protein is 
cleaved to subunrts of 90,000 and 80,000 daiton which in turn are cleaved to the active subunits of 50,000. 
35 43,000 and 73.000 daiton. The B domain between amino acid 742 through 1 648 has no defined function. Vehar, 
GA et al., Nature 312, 330-337 (1984). The same cell systems described for expression of the full length 
recombinant factor VIII protein were used to express the mutant. 



PF8CIS9080 

The eukaryotic expression vector used to express the factor VIII fusion protein included: the enhancer 
(Boshart et al., supra ), and promoter (Thomsen et al., supra ) of the human cytomegalovirus (CMV) Immediate 
early gene ; the splice donor sequence located 3' of the transcription initiation site of this gene (Boshart et al., 
supra, Stenberg et at., supra ); and a synthetic splice acceptor site from the mouse immunoglobulin variable 
region (Bothwell et aL, supra ). The new coding region is flanked on the 3' end by the SV40 early 
polyadenyiation sequence and transcription termination site (Rers et aK, supra ). The vector includes an 
amplifiable marker, the SV40-DHFR transcription unit. 

Construction of the expression vector, pF8CIS9080, encoding the factor VIII fusion protein 90kd + 142aa + 
80kd is shown in Rgure 6. Starting with the plasmid pSVE.8clD (European Patent Publication No. 160,457), a 
short deletion was made in the 3' untranslated region by cutting with Sstll, blunting the cohesive ends wtth S1, 
further cleaving wtth Hpai and rellgatlng the two blunt ends to generate pSVE.8c9. This plasmid was cleaved 
with Oal and Sail and the 10031bp fragment cloned in the Sail. Clal. A 6761 bp promoter containing fragment of 
pAML3P.D22 (European Patent Publication No. 160,457). V ? fusion in the factor VIII gene was made by 
ligattng the filled in Tth111 I and Bam HI (amino acid 1563) sites within the factor VIII gene. Figure 6 shows 
ligation of a 2516bp fragment of pAML3P.8ci (European Patent Publication No. 160,457) and a 11991bp 
fragment of pAMU3P.8c9 to construct pAML3P£L19 containing the fused region. This fusion was confirmed by 
DNA sequence. A 4722bp Clal-Xbal fragment containing the fusion region was cloned into a 5828bp Clat-Xbal 
fragment of pF8CIS containing the CMV promoter-enhancer expression vector. The CMV fragment was 
obtained from a dam- strain of E. coli where methytation does not prevent cutting at the Clal site. 

Example 5 



Expression Results 

The method described In Example 2 was applied to expression of the factor VIII variant which deleted 
nucleotides 796 through 1562, pF8CIS9080. The 90kd + 142aa + 80kd fusion protein Is expressed at higher 
levels than the full length protein. However there remains considerable variation between cell types as to the 
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capability of expressing the fusion protein. 

The following data demonstrates that the choice of a proper host cell will provide continuous production of 
the desired fusion protein in commercially useable quantities. TM4 celts transf ected with pF8ClS9080 showed 
both transient and stable expression of the fusion protein. TM4 cells transfected with pF8CIS9080 showed a 
five-fold increase in the levels of the fusion protein as compared to the full length factor VIII. At 100nM 5 
methotrexate pooled clones of the fusion factor VIII yielded 120111/10* cells/day. HTC cells showed a similar 
enhancement in expression of the fusion factor VIM as compared to the full length factor VIII. 

Expression of the fusion protein factor VIII is quite high In 293 cells as compared to full length factor VIII 
expression. In 293 cells transformed with the fusion protein vector pF8CIS9080 the unampllfled population 
levels of 85 mU/10 4 cells/day were routinely achieved. Expression levels of full length factor VIII were lower W 
than the fusion factor VIII yielding 2.5 mil/ 10* cells/day. Since the control signals are identical in the pF8ClS 
and pF8CIS9080. the difference in expression levels must lie within the capability of the cell to produce full 
length message and/or protein. 

The fusion protein was detected at an earlier point of amplification (100nMJ than the full length (1000 nM), 
however as shown In figure 7 these cells were burdened by the expression of the fusion protein. CHO cells 15 
expressing 90kd + H2aa + 80kd at 100 nM produced 0.5 u.U/10* cells/day. Continued amplification was 
difficult due to mixed population seen in figure 8. Clones selected at 1 |iM MTX and 5 uJvl IvTTX showed no 
higher expression levels than the 0.1 \iM MTX lines. In summary, certain host ceils were particularly adept at 
expression of factor VI II or its variants e.g. TM4. Other host ceils were of an intermediate nature In that the 
variant is expressed while the full length factor VII! Is expressed In tow levels or not at all e.g. 293 cells. A final 20 
group of host cells is unlikely to produce sufficient factor VIII for production, e.g. TRI. 

Example 6 

Purification and Characterization of Fusion Protein 25 

The 90kd + 142aa 4- 80kd fusion was purified from 293 media using techniques previously described for full 
length recombinant factor VIII, Eaton, D.E. et aJ., Biochemistry 25, 505-512 (1986). The purified fusion had a 
specific activity of 4,000-6,000 unlts/mg, which Is comparable to the specific activity of plasma derived factor 
VIII. When subjected to SDS-PAGE the fusion resolved Into two major bands with M r of 115.000 and 80,000. A 
band with a M, of 180,000 was also seen and probably represents the single chain form of the fusion. The M r 30 . 
180.000. 1 15,000, and 80,000 proteins were all detected by a factor VIII polyclonal antibody in a Western blot 
(Figure 11 J. 

Coagulant activity of the 90kd + 142aa +• 80kd fusion was activated 10-20 fold by thrombin (Figure 12). This 
activation correlated with the generation of subunits with M r 50,000, 43,000, and 73,000 (Id.). Since factor VIII 
circulates in plasma bound to von Willebrands factor (vWF), binding of the 90kd + 142aa + 80kd fusion to 35 
vWF was also tested. Purified 90kd + 142aa + BOkd fusion that was passed over a vWF-Sepharose column 
quantitatively bound to the column (Figure 13). Subsequently the fusion was eluted from the column with 0.25 
M CaCl 2t which Is known to dissociate factor VIII-vWF complexes. 

The above data show that the 90kd + 142aa + 80kd fusion expressed and secreted from the 293 cells was 
functionally similar to plasma derived factor VII I. The 90kd + 142aa + 80kd fusion shortened the clotting time 40 
of hemophilic plasma and its activity was neutralized by a factor VIM antibody. The fusion was activated and 
processed by thrombin similarly to plasma derived factor VIII. The 90kd + 142aa + BOkd fusion also bound to 
vWF immobilized on Sepharaose and was dissociated from vWF under conditions known to dissociate factor 
VIII-vWF complexes. 

45 

Example 7 

Coagulant Activity of Fusion Protein 

Serum free media that had been conditioned for 48 hrs. by 293 ceils transfected with pF8CIS9080 was 
assayed for factor VIII coagulant activity by coagulation analysis. After the media was diluted 1/50 It was 50 
assayed and found to shorten the clotting time of hemophilic plasma from 120 sec. to 58.9 sec, corresponding 
to 5.5 units/ml of factor VIII coagulant activity. This activity was neutralized by a polyclonal antibody against 
plasma derived factor VIII (Table 2). 
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5 Sample Cloc Tims Unlts/nl* 

(seconds) 





Buffer 


120.0 


<0.1 


10 


Diluted 293-media 


58.9 


5.5 


15 


Diluted 293-media 
Freincubeted vith 
factor VIII antibody 


118.0 


<0.1 




Undiluted 293-media 
(parent cell line) 


100.0 


<0.1 



* In undiluted media 



25 Table 2. Coagulant actMty in 293 media. 

Hemophilic plasma (50 uJ) was Incubated with 50 of pJatelin (General Diagnostics) for 8 min. at 37° C. 

Subsequently 50 \i\ of media that had been diluted 1/50 wtth 0.05 M Tris pH 7.4 buffer containing 0.01 percent 

BSA was added and incubated 30 sec. CaCb (25 mM), 50 uJ, was added and the clot time was measured. 

Media obtained from the parent cell line, which was not transfected, was not diluted. Antibody neutralization 
30 experiments were performed by preincubating undiluted media (100 ui) with 10 u.g of factor VIII polyclonal 

antibody for 30 min. at room temp. The media was then diluted and assayed. 

Example 8 

35 Expression Vector of Factor VM Variant Resistant to Activated Protein C 

Activated protein C (APC), a plasma protein, has been shown to inactivate human factor VIII by limited 
proteolysis. One possible site of this inactivatlon cleavage is at arginine at position 336. The arginine at position 
336 can be changed to another amino acid, for example, lysine or glutamic acid. Two vectors, pF8ClS336E and 
pF8C!S9080-336E, were constructed to determine whether position 336 was a site of inactivation. Using in 

40 vitro mutagenesis (Norrts, K. et al., Nucleic Acids Research, 11, 5103-5112 [1983]) the arginine at position 336 
was mutated to a glutamic acid (Fig. 9). For the mutagenesis a 792 bp HlndlH-Kpnl fragment from pFBCIS was 
inserted into the Hindlll-Kpnl sites of m13. The 18 bp oligomer shown below was used to mutagenize this 
fragment. 

45 

P Q L E M K H 

5* CC CAA CIA GAA ATG AAA A3' 

50 * 

Following strand extension the double stranded mutagenized M1 3 clone was cut with Acc l and Kpn i. A 778 
bp fragment was gel purified. The plasmid pFBCIS was grown in a dam- strain of E. coli, GM48. Due to the 
sequence of the Pstl-ClaJ linker shown in figure 1 , the ClaJ site of pF8CIS will not cut If the plasmid is grown in a 
55 methytation plus strain of bacteria as discussed above. Two fragments were Isolated from the dam- pF8CIS 
DNA, a 10kb Kpni partfal-Clal fragment and a 1 108 bp Clal-Acct fragment A three part ligation was required to 
replace the native factor VIII sequence with the mutagenized sequence. See figure 9. 

Construction of pF89080-336E proceeded via another three part ligation as shown In Figure 10. A 1 1 15 bp 
Spel-Bglll fragment containing the 336E variant amino acid was transferred to create another variant fusion 
60 protein by ligation to a 891 bp Sacll-Spel fragment and a 8564bp Bglll-Sacll fragment isolated from 
pF8CIS9080. 

Both of these protein variants were expressed In 293 cells. Full length factor VIII wtth this mutation was 
expressed at 2.8 mU/10 4 cells/day while the fusion variant was expressed at 15 mU/10 4 cells/day. 
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Example 9 

Activity of Factor VIII Resistant to Activated Protein C 

Media obtained from 293 cells transfected with pF8CIS-336E shortened the clotting time of hemophilic 
plasma. This activity was neutralized by a factor VIII polyclonal antibody (Table 3). Activated protein C, 5 
however, did not inactivate recombinant factor VIII containing a glutamic acid at position 336 (Table 3). 

Table 3 



10 



Staple 


Clot Time 


Units/ml 




293-336E 


70.4 


0.8 


15 


rVIII 


65.8 


1.1 




293-336E media + APC 


68.5 


0.9 


20 


rVIII ♦ APC 


85.0 


0.28 




293-336E media 
Preincubated with 
factor VIII polyclonal 


95.0 


<0.1 


25 



Table 3. Stability of full length factor V1II/336E to Activated Protein C. Serum free media that had been 
conditioned for 48 hrs by transfected 293 cells producing full length factor VI1I/336E (referred to as "293-336E" x 
in the table) was concentrated 27 fold. To 100 uJ aliquots of this media. 10 uJ of rabbit brain cephaJin and 10.0 
ng of APC was added. Controls received no APC. Samples were incubated for 40 min. at 37* C. Similarly, 
purified recombinant factor VIII containing the arginlne at position 336 (rVIII) was diluted to -1 units/mi with 

0. 05 M Tris, pH 7.5, 150 mM NaCI, 2.5 mM CaCIa and Incubated with rabbit brain cephalln and APC. Media from 
transfected 293 cells producing full length factor VIII/336E (26X) was also preincubated with a factor VIII ^ 
polyclonal antibody (1uJ of IgG prep) for 40 min at 37°C. At the end of Incubations, samples were assayed by 
coagulation analysis. 

Example 10 

40 

Expression Vector Prorelaxin 

1. pClHRX 

The Vector pCIHRX contained the cytomegalovirus enhancer and promoter, the cytomegalovirus splice 
donor site, the Ig variable region splice acceptor site, the cDNA encoding H2'preprorelaxin and the hepatitis ^ 
surface antigen pofyadenytation and transcription termination sites. Figure 14 shows the steps for 
construction of the prorelaxin vector. The same intron and splice acceptor sequence described previously 
from the Ig variable region was maintained. 677bp of the preprorelaxin cDNA followed these 5' processing 
signals. While the 5' control signals were identical to pFBCIS the polyadenylation region and termination 
sequence signals were from the hepatitis surface antigen gene rather than SV40. 50 

An Intermediate plasmid pClaRX was first constructed The plasmid pSVERX (see copending U.S. patent 
application U.S.S.N. 06/907,197, filed September 12, 1986, and corresponding European application) was cut 
with Hlndlll to isolate a 1700bp fragment containing the pre-prorelaxln cDNA followed by the hepatitis B 
surface antigen (HBsAg) 3* polyadenylation site. A Kprt site was 3 f to the HBsAg polyadenylation site and 5' to 
the start of the SV40 early promoter which in this vector was used to drive expression of the DHFR cDNA. 55 

This Hlndlll fragment was Inserted into pML linearized at the Hlndlll site. Reclosures were minimized by 
treatment with bacterial alkaline phosphatase (SAP). Ampiciilin resistant colonies were screened to Isolate 
clones which had inserted the pre-prorelaxin gene so that the 5' end of the gene was next to the Oal site of 
pML 

The intermediate plasmid pClARX was cut with Clal and Kpnl to isolate a 1360bp fragment containing the & 
pre-prorelaxln gene followed by the hepatitis surface antigen 3' polyadenylation sequences. This fragment was 
ligated to the 5143bp fragment created by cutting pFBCIS dam- with Clal and Kpn l. 
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Because the choice of polyadenylation sequences is known to influence 5' processing of messenger RNA 
(Wilson & Nevtns. supra ), the 3' hepatttis potyadenylation sequence in pCIHRX was replaced wtth the pSV40 
potyadenylation sequence. This construction was designated pCISRX. The two starting vectors for this 
5 construction are pCIHRX and pF8CIS. The latter vector has the same 5' controls as pCIHRX but includes the 
cDNA for factor VIII and the SV40 polyadenylation site. Sacll was used to cleave 3 / of the cONA. The resultant 3* 
overhang was blunted by T4 polymerase. pCIHRX was then cut with Bam Hl. This site separates the chimeric 
intron from the 5' end of the relaxin gene. An 861 bp fragment was gel isolated from the BamHl treatment The 
SV40 potyadenylation site, DHFR, transcription unit bacterial origin of replication and amp* gene, as well as the 
10 CMV enhancer and promoter and splice donor were isolated from pF8CtS. These elements were isolated in 
two fragments, as a 2525b p Sall- Bam Hl fragment and a HpaJ-SaJi 31 13 bp fragment. A three part ligation of the 
Bam HI-Sacll (blunted) fragment with the Hpa J- Sai l fragment and Sail to Bam Hl fragment yields pCISRX. 

Example 11 

15 

Expression Pro relaxin 

The expression capabilities of the two relaxin expression vectors pCIHRX and pCISRX, were assayed using 
several anfr-relaxin antibodies in the immunoperoxldase method described above. Three rabbit polyclonals 
and three mouse monoclonal antibodies were tested on COS cells transfected wtth pSVERX. One monoclonal 
20 RX-I was found to give Intense staining with no background. 

The two vectors of this invention, pCIHRX and pCISRX, were tested for prorelaxin expression and compared 
to pSVERX. pCIHRX and pCISRX vectors differed in the polyadenylation sequence. pCIHRX contained the 
hepatitis surface antigen polyadenylation sequence while pCISRX contained the SV40 early region 
potyadenylation sequence. 

25 293, TM4 and CHO cells were transfected with 10 u.g total DNA which included 1 \ig pRSVneo, 5 u.g salmon 
sperm carrier and 4 ug of plasmids pSVERX, pCIHRX and pCISRX. Cells were glycerol shocked as described 
above. Thirty-six hours following transf ection cells were fixed and stained with IH6 to identify transformed cells 
making prorelaxin. Positive staining cells were seen In 293 and TM4 cells transfected with pCIHRX and 
pCISRX. Duplicate plates of CHO, 293 and TM4 cells were split and subjected to the staining protocol 

30 described above to screen for prorelaxin production cells. 

Expression results are shown in the tables below indicating that the vectors containing the stabilizing 
sequence 5' of the DNA encoding prorelaxin produced significantly higher levels of prorelaxin than the 
reference plasmid, pSVERX. In the case of stable expression the media assay for prorelaxin was from the 
general population of cells. 
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Trnulcnt Srergfiatftn Prgrclwln 



Cell Type 

CHO 



flQSTDW 
pSVERX 
pCIHRX 
pCISRX 



Amount of Protein (ny/inn 
0.4 
0.9 
3 



10 



TM4 



pSVERX 
pCIHRX 
pCISRX 



0.4 
2 
10 



15 



293 



pSVERX 
pCIHRX 
pCISRX 



0.4 
3 
12 



20 



Stable Expression Prorelaxln 



CftU Type 
CHO 



pSVERX 
pCIHRX 
pCISRX 



Amount of Protein f^M) 
0.6 
0.8 
3.9 



30 



35 



293 



pSVERX 
pCIHRX 
pCISRX 



0.41 
3.0 
22.0 



40 



45 



Example 12 

Expression Vector t-PA 
1. pCIHt-PA 

The vector pCIHt-PA containing the cytomegalovirus enhancer and promoter, the cytomegalovirus splice 
donor stte and Intron, the Ig variable region splice acceptor site, the cONA encoding t-PA (Pennies et al M 
Nature 301. 214 (1983)} and the hepatitis surface antigen polyadenytation and transcription termination site 
was constructed. 

Figure 16 shows the steps for construction of the t-PA vector. 

The t-PA cDNA was first cloned Into pML to provide a Ctel stte at the 5' end of the gene. To do this a 3238 bp 
Hindlll fragment from pSVpa-DHFR (otherwise referred to as pETPFR In UK patent 2,119.804 B) was inserted 
into the Hindlll site of pML. Colonies were screened for clones which have the 5' end of the cONA juxtaposed 
to the Ctal site. The Intermediate plasmid iabeDed pCLAt-PA is shown In Figure 16. A t-PA cDNA followed by 
the 3' poryadenyiation region was isolated as a Clal-Kpnl fragment of 2870bp. This fragment was flgated to the 
5146bp fragment of pF8CIS. This Cial -Kpn l fragment of the CIS vector provided the 5' control region, a 
SV40-OHFR transcriptional unit, the ampicillin resistance gene and origin region from pML pCIHt-PA is 
analogous to pCIHRX, discussed above, with the exception of the cDNA cooing for the desired heterologous 
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Expression levels of t-PA were compared by transfecting CHO and 293 cells with pSVpaDHFR, pCMVt-PA 
and pCIHt-PA The former two vectors did not contain a stabilizing sequence and thus served as controls for 
the vector pCIHt-PA containing the cDNA encoding t-PA constructed in accord with the instant invention. 
5 Media from each of the cultured transformed 293 cells were assayed and the following results were obtained: 
pSVpaDHFR gave 30 ng/ml ; pCMVt-PA gave 200 ng/ml of t-PA; and pCIHt-PA gave 420 ng/ml of t-PA. 

2 pClSt-PA 

The vector pCISt-PA containing the cytomegalovirus enhancer and promoter, the cytomegalovirus splice 
10 donor site and intron, the !g variable region splice acceptor site, the cDNA encoding t-PA and the pSV40 
oolyadenylation sequence was constructed. 

The starting vectors for this construction are pCIHt-PA and pFBCIS (see Figure 20) . The latter vector has the 
same 5' controls as pCIHt-PA but includes the cDNA for factor vlll and the SV40 polyadenytation site. Sacll 
was used to cleave 3' of the t-PA cDNA. The resultant 3' overhang was blunted by T4 polymerase. pCIHt-PA 
15 was then cut wtth Clal. This site separates the chimeric Intron from the 5' end of the t-PA gene. A 2870bp 
fragment was gel isolated from the Clal treatment. The SV40 polyadenylation site, DHFR, transcription control, 
bacterial origin of replication and ampr gene, as well as the CMV enhancer and promoter and splice donor were 
isolated from pF8CIS. These elements were isolated into fragments as a 2525bp Sall-CIa} fragment and a 
Hpal-SaJI 31 13 fragment A three part ligation of the Sacll(blunt)-aal fragment wtth the Hpal-SaJt fragment and 
20 SaJt-Cial fragment yields pCISt-PA. 

"Expression levels of t-PA were compared by transfecting 293 and CHO cells with pCIHt-PA and pCISt-PA. 
Media from each of the cultured transformed cells were assayed and the following results were obtained: 

Transient 

26 (t-PA ng/ml) 

CHO 







CIS 


55 


30 




CIH 


15 








293 










CIS 


3000 






CIH 


1300 



40 

Claims 

1. A method for continuous production of a desired heterologous protein in a eukaryotic host ceil 
comprising: . 
45 a) constructing an expression vector having a sequence of double stranded DNA comprising the 

following elements: 

1) a stabilizing sequence downstream of a promoter and upstream of a DNA encoding the ammo acid 
sequence of the desired heterologous protein ; 

2) DNA encoding the amino acid sequence of the desired heterologous protein downstream of said 
50 stabilizing sequence; and, 

3) DNA coding a polyadenytation sequence downstream of which Is a transcnption termination site; 

b) transfecting and then choosing a eukaryotic host ceil with said expression vector; and 

c) culturtng the transacted eukaryotic host ceil under conditions favourable for continuous 
production of the desired protein. 

Z The method of claim 1 wherein the promoter is from the Immediate earty gene of human 
cytomegaiovirus. 

3. The method of claim 1 wherein the promoter is from simian virus 40 (SV40) . 

4. The method of any one of claims 1 , 2 and 3 wherein the stabilizing sequence comprises at least one 
but not more than two splice donor-intron-acceptor sequences. 

50 5. The method of claim 4 wherein the splice donor sequence of the splice donor-intron-acceptor 

sequence Is from the Immediate early gene of human cytomegalovirus. 

6. Tne method of claim 4 wherein the intron of the splice donor-intron-acceptor sequence is from tne 
human cytomegaiovirus and the immunoglobulin variable region. 

7. The method of claim 4 wherein the splice acceptor sequence of the splice donor-intron-acceptor 
55 sequence corresponds to the Immunoglobulin acceptor sequence. 
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8. The method of any one of claims 1. 2 and 3 wherein the stabilizing sequence comprises an 
engineered DMA coding a mflNA having the same properties as a mRNA which had been subject to 
spficlng but from which no nucleotide sequence had in fact been removed. 

9. The method of any one of the preceding claims wherein the ONA encoding the amino acid sequence 
of a heterologous protein encodes factor VIII. 

10. The method of any one of claims 1 to 8 wherein the DNA encoding the amino acid sequence of a 
heterologous protein encodes t-PA. 

1 1 . The method of any one of claims 1 to 8 wherein the DNA encoding the amino acid sequence of a 
heterologous protein encodes proretaxln. 

12. The method of any one of claims 1 to 8 wherein the ONA encoding the amino acid sequence of a 
heterologous protein encodes a variant of factor VIII. 

13. The method of claim 12 wherein the factor VIII Is resistant to cleavage by activated protein C. 

14. The method of any one of the preceding claims wherein the ONA coding the potyadenylatlon 
sequence is from simian virus *+Q (SV40). 

15. The method of any one of the preceding claims wherein me host ceti is mouse Sertoli cell (TM4). 

1 6. The method of any one of claims 1 to 1 4 wherein the host ceH Is hepatoma cell (HTC) . 

17. The method of any one of claims 1 to 14 wherein the host ceH Is human embryonic kidney cell (293). 

18. The method of any one of the preceding claims wherein the expression vector includes an enhancer. 

19. The method of claim 18 wherein the enhancer Is located upstream of the promoter. 

20. The method of claim 18 or claim 19 which includes an enhancer and promoter from simian virus 
(SV40). 

21. A vector suitable for continuous expression In a eukaryotlc host cell culture of a desired 
heterologous protein which vector has the features defined in any of the preceding claims. 

22. TM4 cells, HTC cells or 293 cells transformed with an expression vector having the features defined in 
any one of claims 1 to 20. 
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Bom HI \^ / 
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SV40 



DHFR 



HBsAg 3' 

So" pML 




^"R^pSVEBclDt- 00 ' 

3HBsAg\. //FBcDNA 
Hpol 



Replace HBsAg 3' 



3- port ligation: 

a. pUC.SV40 Hpol-EcoRI 143 bp 
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I8mer used for mutagenesis 
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KpN I partial -Cla 
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SV40 poly 
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Acct 
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3- part ligation: 
Accl -KpN I 
KpN partial - Cla I 
Cla I - Acc I 



m!3 clone 

pF8CIS 

pFSCIS 
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3- part ligation: 

Sacll-Spel pF8CIS9080 

Spel-Bglll pF8CIS336E 

Bglll-Sacll pF8CIS9080 

V 



Fig.10. 
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Hind III 
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Hind III 
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Hind III 



HBsAg 
Hindlll^r 1 
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Bam HI 



Sal I 
Hind III 

BAP treatment 

Isolate pML linear DNA 
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Hind III 



0260148 



Eco Rt Cla I 

.Mind! 



CMV 
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SVE 
Hind 111 
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BAP treatment 

Isolate 5143 bp fragment 
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DHFR 



|ligate 
CMV 
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DHFrVV JTEcoRI 
HBsAg 
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3-part ligation: 

Bam Hl-Sac II (blunt) pCIHRX 

Sal l-Bam HI pF8CIS 

Sal l-Hpa I pF8C!3 



Fig.15. 
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Hind III 
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0111 
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BA? treatment 
isolate linear DNA 
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Hind III 
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Sau 3A 
3' HBsAg 
KpNI 



pF8CIS IJf8cdna 

DHFR\ \ y/ 



f * KpN I 



SVE 



Hind 



KpNI SV40 
poly A 



Clal 
KpNI 

Isolate 2870 bp fragment 



Clal (Dm") 
KpN I 

BAP treatment 

Isolate 51 46 bp fragment 



Cla I Hind 111 
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tPA cDNA 
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3' HBsAg 
KpNI 
I 



DHFR 
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CMV 

pMLS ^^al 
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alul 
sacl 
hgiAI 
bspl286 
banll 

taql spel 
1 TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 
AAGCTCGAGC GGGCTGTAAC TAATAACTGA TCAATAATTA TCATTAGTTA ATGCCCCAGT 
from pPMLCMV beginning to Hindlll, enhancers and promoter 

scrFI 
bgll bstNI 
sau96I 

thai haelll 
61 TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
AATCAAGTAT CGGGTATATA CCTCAAGGCG CAATGTATTG AATGCCATTT ACCGGGCGGA 

ahall 
aatll 

121 GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA 
CCGACTGGCG GGTTGCTGGG GGCGGGTAAC TGCAGTTATT ACTGCATACA AGGGTATCAT 

ahall 

aatll bgll 
181 ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC 
TGCGGTTATC CCTGAAAGGT AACTGCAGTT ACCCACCTCA TAAATGCCA1 TTGACGGGTG 

ahall 

rsal ndel rsal aatll 

241 TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 
AACCGTCATG TAGTTCACAT AGTATACGGT TCATGCGGGG GATAACTGCA GTTACTGCCA 

bgll 

sau96I scrFI nlalll 

haelll bstNI rsal rsal 

301 AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG 
TTTACCGGGC GGACCGTAAT ACGGGTCATG TACTGGAATA CCCTGAAAGG ATGAACCGTC 

nlalll 
styl sfaNI 

snaBI ncol hphl rsal 

361 TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GG TTTTGGCA GTACATCAAT 
ATGTAGATGC ATAATCAGTA GCGATAATGG TACCACTACG CCAAAACCGT CATGTAGTTA 

ahall 

hinfl aatll 
421 GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT 
CCCGCACCTA TCGCCAAACT GAGTGCCCCT AAAGGTTCAG AGGTGGGGTA ACTGCAGTTA 



nlalV 
ban I 

481 GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC 
CCCTCAAACA AAACCGTGGT TTTAGTTGCC CTGAAAGGTT TTACAGCATT GTTGAGGCGG 
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alul 
sad 
hgiAI 
bspl286 

hgal rsal mnll banll 

541 CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG TCTATATAAG CAGAGCTCGT 
GGTAACTGCG TTTACCCGCC ATCCGCACAT GCCACCCTCC AGATATATTC GTCTCGAGCA 

scrFI 
sau3AI hgal 

dpnl bstNI ahall fokl mnll mboll 

601 TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA 
AATCACTTGG CAGTCTAGCG GACCTCTGCG GTAGGTGCGA CAAAACTGGA GGTATCTTCT 
Begin RNA 

scrPI 

sau96I ncil 

avail haelll 
nlalV xmalll 
scrFI eael 
ncil fnu4HI 
mapl sau3AI mnll thai mspl 

hpall dpnl bgll sacll hpall thai hinfl 

661 CACCGGGACC GATCCAGCCT CCGCGGCCGG GAACGGTGCA TTGGAACGCG GATTCCCCGT 
GTGGCCCTGG CTAGGTCGGA GGCGCCGGCC CTTGCCACGT AACCTTGCGC CTAAGGGGCA 

bstXI 
sau96I 

rsal hinfl haelll styl 

721 GCCAAGAGTG ACGTAAGTAC CGCCTATAGA GTCTATAGGC CCACCCCCTT GGCTTCTTAT 
CGGTTCTCAC TGCATTCATG GCGGATATCT CAGATATCCG GGTGGGGGAA CCGAAGAATA 

hael 
eael 

sau3AI ball 
dpnl sau3AI 
xholl alul dpnl 

nlalV ddel mnll xholl 

bamHI rsal hindlll bglll haelll 

781 GCGACGGATC CCGTACTAAG CTTGAGGTGT GGCAGGCTTG AGATCTGGCC ATACACTTGA 
CGCTGCCTAG GGCATGATTC GAACTCCACA CCGTCCGAAC TCTAGACCGG TATGTGAACT 
IgE synthetic lOOmer 

fnu4HI 
bbvl 

fokl PStI 
841 GTGACAATGA CATCCACTTT GCCTTTCTCT CCACAGGTGT CCACTCCCAC GTCCAACTGC 
CACTGTTACT GTAGGTGAAA CGGAAAGAGA GGTGTCCACA GGTGAGGGTG CAGGTTGACG 

Pstl-Clal 
converter 



clal 
sau3AI 
dpnl 
pvul 

aluj taql taql 
901 AGCTCGGTTC GATCGATAA 
TCGAGCCAAG CTAGCTATT 



Fig.17(cont.) 
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Fig.19. 

alul 
sad 
hgiAI 
bspl286 
banll 

taql spel 
1 TTCGAGCTCG CCCGACATTG ATTATTGACT AGTTATTAAT AGTAATCAAT TACGGGGTCA 
AAGCTCGAGC GGGCTGTAAC TAATAACTGA TCAATAATTA TCATTAGTTA ATGCCCCAGT 
from pPMLCMV beginning to Hindlll, enhancers and promoter 

scrFI 
bgll bstNI 
sau96I 

thai haelll 
61 TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC TTACGGTAAA TGGCCCGCCT 
AATCAAGTAT CGGGTATATA CCTCAAGGCG CAATGTATTG AATGCCATTT ACCGGGCGGA 



ahall 
aatll 

121 GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA TGACGTATGT TCCCATAGTA 
CCGACTGGCG GGTTGCTGGG GGCGGGTAAC TGCAGTTATT ACTGCATACA AGGGTATCAT 



aha 1 1 

aatll bgll 
181 ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGAGT ATTTACGGTA AACTGCCCAC 
TGCGGTTATC CCTGAAAGGT AACTGCAGTT ACCCACCTCA TAAATGCCAT TTGACGGGTG 



ahall 

rsal ndel rsal aatll 

241 TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC CTATTGACGT CAATGACGGT 
AACCGTCATG TAGTTCACAT AGTATACGGT TCATGCGGGG GATAACTGCA GTTACTGCCA 



bgll 

' sau96I scrFI nlalll 

haelll bstNI rsal rsal 

301 AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT GGGACTTTCC TACTTGGCAG 
TTTACCGGGC GGACCGTAAT ACGGGTCATG TACTGGAATA CCCTGAAAGG ATGAACCGTC 



nlalll 
styl sfaNI 

snaBI ncol hphl rsal 

361 TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC GGTTTTGGCA GTACATCAAT 
ATGTAGATGC ATAATCAGTA GCGATAATGG TACCACTACG CCAAAACCGT CATGTAGTTA 
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ahalX 

hinfl aatll 
421 GGGCGTGGAT AGCGGT" r-L A CTCACGGGGA TTTCCAAGTC TCCACCCCAT TGACGTCAAT 
CCCGCACCTA TCGCCAAACT GAGTGCCCCT AAAGGTTCAG AGGTGGGGTA ACTGCAGTTA 



nlalV 
ban I 

481 GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA AATGTCGTAA CAACTCCGCC 
CCCTCAAACA AAACCGTGGT TTTAGTTGCC CTGAAAGGTT TTACAGCATT GTTGAGGCGG 

alul 
sacl 
hgiAI 
bspl286 

hgal rsal ranll banll 

541 CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTCGGAGG TCTATATAAG CAGAGCTCGT 
GGTAACTGCG TTTACCCGCC ATCCGCACAT GCCACCCTCC AGATATATTC GTCTCGAGCA 

SCrFI 
sau3AI hgal 

dpnl bstNI ahall fokl mnll mboll 

601 TTAGTGAACC GTCAGATCGC CTGGAGACGC CATCCACGCT GTTTTGACCT CCATAGAAGA 
AATCACTTGG CAGTCTAGCG GACCTCTGCG GTAGGTGCGA CAAAACTGGA GGTATCTTCT 
Begin RNA 

scrFI 

sau96I ncil 

avail haelll 
nlalV xmalll 
scrFI eael 
ncil fnu4HI 
mspl sau3AI mnll thai mspl 

hpall dpnl bgll sacll hpall hphl thai hinfl 

661 CACCGGGACC GATCCCAGCC TCCGCGGCCG GGAACGGTGA TTGGAACGCG GATTCCCCGT 
GTGGCCCTGG CTAGGGTCGG AGGCGCCGGC CCTTGCCACT AACCTTGCGC CTAAGGGGCA 

clal 

alul sau3AI 
fnu4HI dpnl 
bbvl mspl taql taql 
tthllll pstl hpall pvul 

721 GCCAAGAGTG ACGGTGTCCA CTCCCACGTC CAACTGCAGC TCCGGTTCGA TCGATAA 
CGGTTCTCAC TGCCACAGGT GAGGGTGCAG GTTGACGTCG AGGCCAAGCT AGCTATT 



Fig.19(cont.) 
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3-part ligation: 

Sacll(blunt)-Clal pCIHtPA 

Sal I -Cla I pFSCIS 

Sal l-Hpa I pFSCIS 



£MV sac II 

y Bam HI 
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Sail 
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tPA cDNA 
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poly A 



KpNI 



